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Endogenous gene expression modification with regulatory 
element. 

5 FIELD OF INVENTION 

The present invention relates to a process for 
the modification of the expression characteristics of a 
gene which is naturally present within the genome of a 
stable cell line or cloned microorganism. In the preferred 

10 embodiment , the present invention relates to a process for 
the activation and expression of a gene that is present 
within a stable cell line and normally transcriptionally 
silent or inert* As a result, the protein product of that 
gene is expressed. This phenomenon occurs without 

15 transf ecting the cell with the DNA that encodes the 

product. Rather, the resident gene coding for the desired 
product is identified within a cell and activated by 
inserting an appropriate regulatory segment through a 
technique called homologous recombination. Positive and/or 

20 negative selectable markers can also be inserted to aid in 
selection of the cells in which proper homologous 
recombination events have occurred. As an additional 
embodiment, a specified gene can be amplified for enhanced 
expression rates, whether that gene' is normally 
25 transcriptionally silent and has been activated by means of 
the present invention, or endogenously expresses product. 

BACKGROUND OF THE INVENTION 
It is well known that each cell within an 
30 organism contains the genetic information that encodes all 
of the proteins found within that organism. However, only 
a very small percentage of the genes present within a givien 
cell type is actually transcribed. The intracellular 
mechanisms that regulate the array of genes to be 
35 transcribed are now understood. C 11 specif ic prot ins 
present within the nucleus interact with DNA regulatory 
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Segments that are linked with particular genes. This 
interaction of nuclear proteins with DNA regulatory 
sequences is required for gene transcription. This results 
in mRNA biosynthesis and ultimate expression of the encoded 
5 protein (Mitchell and Tjian, Science , 245:371,1989). 

These DNA regulatory segments or elements for 
each gene lie upstream from and, in some cases, within or 
even downstream of the coding regions. Through an 
interaction with cell specific nuclear proteins, DNA 

10 regulatory segments affect the ability of RNA polymerase, 
the rate limiting enzyme in protein expression, to gain 
access to the body of the gene and synthesize a mRNA 
transcript. Thus, these DNA segments and the resident 
nuclear proteins play a critical role in the regulation of 

15 expression of specific genes (Johnson and McKnight, Ann. 
Rev. Biochem. . 58:799, 1989). 

The DNA regulatory segments are binding sites for 
the nuclear proteins. These nuclear proteins attach to the 
DNA helix and apparently alter its structure to make the 

20 desired gene available for RNA polymerase recognition, 

which facilitates gene transcription. The expression of 
these cell specific regulatory proteins determines which 
genes will be transcribed within a cell and the rate at 
which this expression will occur. As an example" of * the 

25 specificity of this system, pituitary cells but not liver 
cells express pituitary proteins, even though the genes for 
the pituitary proteins are present within all liver cells. 
Nuclei of the liver cells do not contain the specific DNA 
binding proteins which interact with the elements of 

30 pituitary genes resident within the liver cells. 

Current Methods Employed to Express Proteins Using 
Recombinant DNA Technology 

With the knowledge that specific DNA regulatory 
35 sequences are required to activate gene transcription 

within a particular cell type, scientists have expressed 
foreign genes within a particular cell type through genetic 
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engineering. In general, DNA regulatory segments that are 
recognized by the cell's nuclear proteins are placed 
upstream from the coding region of a foreign gene to be 
expressed. In this way, after insertion into the cell, 
5 foreign DNA may be expressed since the cell'.s nuclear 
regulatory proteins now recognize these DNA regulatory 
sequences. This technology has been employed to produce 
proteins that have been difficult to obtain or purify from 
natural sources by traditional purification strategies • 
10 in addition to the recognizable DNA sequences and 

the gene of interest, a selectable marker is attached to 
the DNA construction. In this way, only the cells that 
have taken up the DNA survive following culture in a 
selectable medium. For example, the gene for neomycin 
15 resistance may be included in the expression vector. 
Following transfection, cells are cultured in G418, a 
neomycin antibiotic that is lethal to mammalian cells. If 
however, the cells have acquired the neomycin resistance 
gene, they will be able to withstand the toxic effects of 
20 the drug. In this way, only the cells that have taken up 
the transfected DNA are maintained in culture. It is 
understood that any selectable marker could be used as long 
as it provided for selection of cells that had taken up the 
transfected DNA. It is further understood that there is no 
25 criticality as to the specific location of the inserted 

genetic material within the cell. It is only important that 
it be taken up somewhere within the nucleus as both the 
regulatory segment and the foreign gene (as well as the 
selectable marker) are inserted together. 

30 

Deficiencies in the Current Methods of Gene Expression 

While the above techniques have been instrumental 
in exploiting the power of genetic engineering, they have 
not always been the most efficient methods to express 
35 genes. This is due to the fact that insertion of DNA into 
the nucleus of a cell line is usually accomplished through 
a technique known as transfecti n. DNA that has been 
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engineered for expression in the cell line of interest is 
precipitated and the cell membrane is solubilized to allow 
entry of the DNA. As indicated above, the exact site into 
which the DNA incorporates into the genome is never 
5 predictable; indeed the DNA may remain episomal (not 
integrated into the genome) . This results in the 
unpredictability of both the level of expression of the 
protein produced and the stability of the cell line. 

A second shortcoming of this technique is the 
10 fact that the construction of the expression vector is 

extremely difficult when the gene of interest is relatively 
large (greater than 5-10 kilobases) . Many of the proteins 
expressed by recombinant DNA technology have been encoded 
by cDNAs rather than much larger genomic clones. This is 
15 done to reduce the overall size of the insert. While the 
use of cDNAs makes genetic engineering more convenient, 
rates of gene transcription and protein production may 
suffer as a result. It has recently been shown that 
expression levels are sometimes greatly enhanced through 
the use of genomic rather than cDNA inserts (Brinster et 

al *» Proc. Natl. — Acadj Sci. . 85:836-840, 1988, and Chung 

and Perry, Mol. Cell . Biol. . 9:2075- 2082, 1989). 
Although the mechanisms responsible for this observation 
are not well understood, it is known that in certain 
situations enhancer elements present within introns can 
improve the transcriptional efficiency of the gene. There 
is also evidence that introns, or the splicing events which 
result from the presence of introns, may have an effect on 
the RNA processing events which follow the initiation of 
transcription (Buchman and Berg, Mol. Cell. Biol . r 8:4395- 
4405, 1988). This may stabilize the transcript thereby 
improving the rate of mHNA accumulation. In the above 
cited Brinster et al paper, it is also postulated that the 
position of the introns within the gene may be important 
for phasing of nucleosomes relative to the promoter. The 
influence of various regulatory elements on transcription 
of eukaryotic genes is discussed in Khoury et al, Cell . 



20 



25 



30 



35 
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15 



33:313-14 (1983), Maniatis et al, Science . 236:1237-45 

(1987) and Muller et al, Eur. J. Biochem. . 176:485-95 

(1988) . 

Thirdly, to gain entry into the nucleus, the 
5 transfected DNA, including the entire coding region of the 
foreign gene, must traverse the cytoplasm following entry 
through the permeabilized plasma membrane of the cell. 
During that time, the DNA may come in contact with 
lysosomal enzymes which may alter or completely destroy the 
10 integrity of the DNA. Thus, the coding region of the DNA 
may not be identical to that which was transfected. 

The novel method of gene activation and/or 
expression modification that we describe below cannot 
result in the production of mutant forms of the desired 
protein, since the coding region of the desired gene is not 
subjected to enzymatic modifications. 

In summary, a large amount of the DNA transfected 
into the cell using traditional techniques, and 
particularly the coding region thereof, will not be 
2 0 faithfully transcribed. It may be degraded prior to entry 
into the nucleus, enzymatically perturbed so that it will 
not encode the entire desired protein or it may not contain 
all of the necessary regulatory segments to allow for 
transcription. It may be inserted into a portion of the 
25 genome that prevents transcription. If the cDNA is 

transcribed, the protein of interest may not be produced 
efficiently due to the omission of introns which may 
contain enhancers or enable efficient mENA processing. 
Finally, it may remain episomal, promote protein production 
30 but be unstable as the cell population grows through cell 
division. 

It would be most desirable to develop a method of 
induction of gene expression that would produce a cell line 
that has incorporated the positive attributes of the 
35 existing methods but somehow circumvents the unattractive 
features. It would further be desirable to be able to 
express r modify endogenous expression of particular genes 
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in the cell type of choice, it is further desired to be 
able to take advantage of the potential benefits that may 
be afforded by a complete genomic sequence which may 
include cryptic transcriptional enhancers that may reside 
within introns, by appropriate placement of introns for 
proper nucleosome phasing or by more efficient mRNA 
processing events. These advantages are ordinarily not 
enjoyed in recombinant DNA expression methods due to the 
size of the gene. If one were able to express a gene that 
is already resident in the genome, i.e., an endogenous 
gene, cell line stability and expression rates would become 
more consistent and predictable. 

SUMMARY OF THE INVENTION 
Accordingly, it is an object of the present 
invention to eliminate the above-noted deficiencies in the 
prior art. 

It is another object of the present invention to 
provide a method of regulation and/or amplification of gene 
expression that incorporates the positive attributes of 
recombinant gene technology but circumvents the 
unattractive features. 

It is a further object of the present invention 
to provide a method for expressing specific genes present 
but normally transcriptionally silent in a cell line of 
choice. 

It is yet a further object of the present 
invention to provide a method for expressing proteins which 
takes full advantage of complete genomic sequences that are 
responsible for mRNA accumulation and/or transcription. 

It is still another object of the present 
invention to provide a method of modifying the expression 
characteristics of a gene of interest by inserting DNA 
regulatory segments and/ or amplifying segments into the 
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genome of a stable cell line or cloned microorganism 
upstream of, within, or otherwise proximal to the native 
gene of interest. 

It is still a further object of the present 
5 invention to provide a method for modifying the expression 
characteristics of a gene which is naturally present within 
the genome of a stable cell line or cloned microorganism 
and at the same time insert characteristics which will aid 
in the selection of cells which have been properly 
10 modified. 

It is yet another object of the present invention 
to provide a genome having therein, proximal to the coding 
region or exons of a gene of interest, a regulatory or 
amplifying segment which does not naturally appear 
15 thereat. 

It is another object of the present invention to 
provide DNA constructs which can be used for accomplishing 
the homologous recombination methods of the present 
invention. 

20 It is a further object of the present invention 

to provide cell lines and microorganisms which include the 
genomes in accordance with the present invention. 

These and other objects of the present invention 
are accomplished by means of the technique of homologous 
25 recombination, by which one of ordinary skill in this art 
can cause the expression and, preferably, amplification of 
resident, albeit transcriptionally silent genes. By this 
technique, one can also modify the expression 
characteristics of a gene which is naturally present, but 
30 not necessarily silent or inert, within the genome of a 
stable cell line, such as, for example, to make the 
expression conditional, i.e. , reptessible or inducible, or 
to enhance the rate of expression. 

The present invention provides a method of 
35 modifying the expression characteristics of a gene within 
the genome of a cell line or microorganism. A DNA 
construct is inserted into that gen me by the technique of 
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homologous recombination. The construct includes a DNA 
regulatory segment capable of modifying the expression 
characteristics of any gene to which it is operative ly 
linked within the host cell line or microorganism, as well 
5 as a targeting segment homologous to a region of the genome 
at which it is desired for the DNA regulatory segment to be 
inserted. The construct and insertion technique is 
designed to cause the new DNA regulatory segment to be 
operatively linked to the gene of interest. Thus, without 
10 necessarily inserting any new coding exons, the expression 
characteristics of that gene are modified. In the 
preferred embodiment, the gene is one which is normally 
transcriptionally silent or inert within the host cell line 
or microorganism and, by means of the DNA regulatory 
15 region, which is targeted directly to the appropriate 

position with respect to that gene by means of homologous 
recombination, that gene is thereby activated for 
expression of its gene product. 

The DNA construct preferably includes two 
20 targeting segments which, while separated from one another 
in the construct by those elements to be inserted into the 
genome, are preferably contiguous in the native gene. 

The construct further preferably includes at 
least one expressible selectable marker gene, such as the 
25 gene providing neomycin resistance. This marker gene, 

including a promoter therefor, is also disposed between the 
two targeting regions of the construct. 

In another embodiment, the construct includes an 
expressible amplifiable gene in order to amplify expression 
30 of the gene of interest. This gene, including a promoter 
therefor, is also disposed between the two targeting 
regions. of the construct. In some cases the selectable 
marker and the amplifiable marker may be the same. 

In a further embodiment of the present invention, 
35 the DNA construct includes a negative selectable marker 
gene which is not expressed in cells in which the DNA 
construct is pr perly inserted. This negative selectable 
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marker gene is disposed outside of the two targeting 
regions so as to be removed when the construct is properly 
combined into the gene by homologous recombination. An 
example of such a negative selectable marker gene is the 
5 Herpes Simplex Virus thymidine kinase gene. 

In yet a further embodiment, it is possible to 
modify the expression characteristics of a specific gene 
which already expresses a product in the cell line or 
microorganism of interest. This can be accomplished by 
10 inserting by homologous recombination a DNA construct which 
includes (1) an expressible amplifiable gene which 
increases the copy number of the gene of interest when the 
cell line or microorganism is subjected to amplification 
conditions and/or (2) a promoter/ enhancer element (or other 
15 regulatory element) which modifies the expression of the 
gene of interest such as, for example, by increasing the 
rate of transcription, increasing translation efficiency, 
increasing mRNA accumulation, making the expression 
inducible, etc. The gene expression which is modified in 
20 this manner may be natural expression or expression which 
has been caused by previous genetic manipulation of the 
cell line or microorganism. The previous genetic 
manipulation may have been by conventional techniques or by 
means of homologous recombination in accordance with the 
25 present invention. In the latter case, the DNA insertion 
which results in the modification of expression 
characteristics may be accomplished as part of the same 
genetic manipulation which results in expression of the 
gene or may be performed as a subsequent step. 
30 The present invention also includes the 

constructs prepared in accordance with the above discussion 
as well as the genomes which have been properly subjected 
to homologous recombination by means of such constructs and 
the cell lines and microorganisms including these genomes. 

35 
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Moreover, a process for preparation of the desired product 
by culturing the transformed cells according to the present 
invention is also included. 

5 BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows a general outline of a DNA construct 
in accordance with the present invention. 

Fig. 2A shows the mode of integration of the DNA 
construct into the genome in the event of non-homologous or 
10 random recombination. 

Fig. 2B shows the mode of integration of the DNA 
construct in the genome in the event of homologous 
recombination • 

Fig. 3 shows the construction of a preferred 
15 homologous recombination vector in accordance with the 
present invention. 

Fig. 4 shows the mode of integration of a 
circular piece of DNA by homologous recombination when only 
a single targeting piece of DNA is employed. 
20 Fig. 5 shows the pRSVCAT plasmid, including the 

restriction sites thereof ♦ 

Fig. 6 shows the construction of the pRSV 
plasmid, including the restriction sites thereof. 

Fig. 7 shows the pSV2NE0' plasmid, including the 
25 restriction sites thereof. 

Fig. 8 shows the construction of the pSVNEOBAM 
plasmid, including the restriction sites thereof. 

Fig. 9 shows the construction of the pRSVNEO 
plasmid, including the restriction sites thereof. 
30 Fi 9- 10 shows the construction of the pRSVCATNEO 

plasmid, including the restriction sites thereof. 

Fig. 11 shows a 15.3 kb fragment of the rat TSHB 
gene and showing various restriction segments thereof. 
Fig. 12 shows the construction of the 
35 pRSVCATNEOTSHB3 plasmid, including the restriction sites 
thereof. 



WO91/09W5 



PCT/US90/07642 



- 11 - 

Fig. 13 shows the construction of the 
pRSVCATNEOTSHB3-5XbaI plasmid, including the restriction 
sites thereof. 

Fig. 14 shows a portion of the nucleotide 
5 sequence of TSH£ along with the regions thereof to which 
each primer for PCR amplification corresponds* Exons 2 and 
3 are shown in capital letters. A 247 BP amplified 
fragment is shown by underlined asterisks. 

Fig. 1 15 shows the results of polyacrylamide gel 
10. electrophoresis of cDNA synthesized from RNA extracted from 
various cell populations and whose TSHfi cDNA, if present, 
has been amplified by PCR. The nature of the cells 
representing the various lanes is set forth in Fig. 15 
below the gel. 

15 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

Homologous recombination is a technique developed 
within the past few years for targeting genes to induce or 
correct mutations in transcriptionally active genes 
20 (Kucherlapati, Proa, in Nucl. Acid Res, and Mol. Biol. . 

36:301 (1989)). This technique of homologous recombination 
was developed as a method for introduction of specific 
mutations into specific regions of the mammalian genome 
(Thomas et al., Cell , 44:419-428, 1986; Thomas and" 
25 Capecchi, Cell . 51:503-512, 1987; Doetschman et al. , Proc. 
Natl. Acad. Sci. , 85:8583-8587, 1988) or to correct 
specific mutations within defective genes (Doetschman et 
al., Nature . 330:576-578, 1987). 

Through this technique, a piece of DNA that one 
30 desires to be inserted into the genome can be directed to a 
specific region of the gene of interest by attaching it to 
"targeting DNA". "Targeting DNA" is DNA that is 
complementary (homologous) to a region of the genomic DNA. 
If two homologous pieces of single stranded DNA (i.e., the 
35 targeting DNA and the genomic DNA) are in close proximity, 
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they will hybridize to form a double stranded helix. 
Attached to the targeting DNA is the DNA sequence that one 
desires to insert into the genome. 

There are a number of methods by which homologous 
recombination can occur. One example is during the process 
of replication of DNA during mitosis in cells. 

Through a mechanism that is not completely 
understood, parental double-stranded DNA is opened 
immediately prior to cell division at a local region called 
the replication bubble. The two separated strands of DNA 
may now serve as templates from which new strands of DNA 
are synthesized. One arm of the replication fork has the 
DNA code in the 5' to 3' direction, which is the 
appropriate orientation from which the enzyme DNA 
polymerase can "read". This enzyme attaches to the 5» 
portion of the single stranded DNA and using the strand as 
a template, begins to synthesize the complementary DNA 
strand. The other parental strand of DNA is encoded in the 
3' to 5« direction. It cannot be read in this direction by 
DNA polymerase. For this strand of DNA to replicate, a 
special mechanism must occur. 

A specialized enzyme, RNA primase, attaches 
itself to the 3' to 5' strand of DNA and synthesizes a 
short RNA primer at intervals along the strand. Using 
these RNA segments as primers, the DNA polymerase now 
attaches to the primed DNA and synthesizes a complementary 
piece of DNA in the 5' to 3' direction. These pieces of 
newly synthesized DNA are called Okazaki fragments . The 
RNA primers that were responsible for starting the entire 
reaction are removed by the exonuclease function of the DNA 
polymerase and replaced with DNA. This phenomenon 
continues until the polymerase reaches an unprimed stretch 
of DNA, where the local synthetic process stops. Thus, 
although the complementary parental strand is synthesized 
overall in the 3' to 5 • direction, it is actually produced 
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by "backstitching" in the 5' to 3' direction. Any nicks 
that might occur in the DNA during the '"backstitching" 
process are sealed by an enzyme called DNA ligase. 

To maintain an absolute fidelity of the DNA code, 
5 a proofreading function is present within the DNA 

polymerase* The DNA polymerase requires primed pieces of 
DNA upon which to synthesize a new strand of DNA. As 
mentioned above, this can be a single strand of DNA primed 
with RNA, or a complementary strand of DNA. When the DNA 
10 polymerase finds mismatched complementary pieces of DNA, it 
can act as an exonuclease and remove DNA bases in a 3' to 
5 1 direction until it reaches perfect matching again. 

With this background, it is now possible to 
understand the basis of the technique described herein. 
15 Small pieces of targeting DNA that are complementary to a 
specific region of the genome are put in contact with the 
parental strand during the DNA replication process. It is 
a general property of DNA that has been inserted into a 
cell to hybridize and therefore recombine with other pieces 
20 of DNA through shared homologous regions. If this 

complementary strand is attached to an oligonucleotide that 
contains a mutation or a different sequence of DNA, it too 
is incorporated into the newly synthesized strand as a 
result of the recombination. As a result of the proof - 
25 reading function, it is possible for the new sequence of 

DNA to serve as the template. Thus, the transfected DNA is 
incorporated into the genome. 

If the sequence of a. particular gene is known, a 
piece of DNA that is complementary to a selected region of 
30 the gene can be synthesized or otherwise obtained, such as 
by appropriate restriction of the native DNA at specific 
recognition sites bounding the region of interest. This 
piece will act as a targeting device upon insertion into 
the cell and will hybridize to its homologous region within 
35 the genome. If this hybridization occurs during DNA 

replication, this piece of DNA, and any additional sequence 
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attached thereto, will act as an Okazaki jfragment and will 
be backs titched into the newly synthesized daughter strand 
of DNA. 

In the technique of the present invention, 
5 attached to these pieces of targeting DNA are regions of 
DNA that are known to interact with the nuclear regulatory 
proteins present within the cell and, optionally, 
amplifiable and selectable DNA markers. Thus, the 
expression of specific proteins may be achieved not by 

10 transf ection of DNA that encodes the gene itself and marker 
DNA, as is most common, but rather by the use of targeting 
• DNA (regions of homology with the endogenous gene of 
interest) coupled with DNA regulatory segments that provide 
the gene with recognizable signals for transcription. With 

15 this technology, it is possible to express and to amplify 
any cognate gene present within a cell type without 
actually transfecting that gene. In addition, the 
expression of this gene is controlled by the entire genomic 
DNA rather than portions of the gene or the cDNA, thus 

20 improving the rate of transcription and efficiency of mRNA 
processing. Furthermore, the expression characteristics of 
any cognate gene present within a cell type can be modified 
by appropriate insertion of DNA regulatory segments and 
without inserting" entire coding portions of the gene of 

25 interest. 

In accordance with these aspects of the instant 
invention there are provided new methods for expressing 
normally transcriptionally silent genes of interest, or for 
modifying the expression of endogenously expressing genes 

30 of interest, within a differentiated cell line. The 

cognate genomic sequences that are desired to be expressed, 
or to have their expression modified, will be provided with 
the necessary cell specific DNA sequences (regulatory 
and/or amplification segments) to direct or modify 

35 expression of the gene within the cell. The resulting DNA 
will comprise the DNA sequence coding for the desired 
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protein directly linked in an operative way to heterologous 
(for the cognate DNA sequence) regulatory and/or 
amplification segments i A positive selectable marker is 
optionally included within the construction to facilitate 
5 the screening of resultant cells. The use of the neomycin 
resistance gene is preferred, although any selectable 
marker may be employed. Negative selectable markers may, 
optionally, also be employed. For instance, the Herpes 
Simplex Virus thymidine kinase (HSVtk) gene may be used as 
10 a marker to select against randomly integrated vector DNA. 
The fused DNAs, or existing expressing DNAs, can be 
amplified if the targeting DNA is linked to an amplifiable 
marker. 

Therefore, in accordance with the method of the 
15 present invention, any gene which is normally expressed 
when present in its specific eukaryotic cell line, 
particularly a differentiated cell line, can be forced to 
expression in a cell line not specific for it wherein the 
gene is in a silent format. This occurs without actually 
20 inserting the full DNA sequence for that gene. In 

addition, that gene, or a normally expressing gene, can be 
amplified for enhanced expression rates. Furthermore, the 
expression characteristics of genes not totally 
transcriptionally silent can be modified as can the 
25 expression characteristics of genes in microorganisms. 

In one embodiment of the present invention, 
eukaryotic cells that contain but do not normally 
transcribe a specific gene of interest are induced to do so 
by the technique described herein. The homologous 
30 recombination vector described below is inserted into a 
clonal cell line and, following chemical selection, is 
monitored for production of a specific gene product by any 
appropriate means, such as, for example, by detection of 
mRNA transcribed from the newly activated gene, 
35 immunological detection of the specific gene product, or 
functional assay for the specific gene product. 
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The general outline of the DNA construct that is 
used to transcriptionally activate endogenous genes by 
homologous recombination is depicted in Figure l. 

In general, the DNA construct comprises at least 
two and up to six or more separate DNA segments. The 
segments consist of at least one, preferably two, DNA 
targeting segments (A and B) homologous to a region of the 
cell genome within or proximal to the gene desired to be 
expressed, a positive selection gene (C) , an amplif iable 
gene (D) , a negative selection gene (E) and a DNA 
regulatory segment (p) which is transcriptionally active in 
the cell to be transfected. In the most basic embodiment- 
of the present invention, only a single targeting segment 
(B) and the regulatory segment (F) must be present. All of 
15 the other regions are optional and produce preferred 
constructs. 

Regions A and B are DNA sequences which are 
homologous to regions of the endogenous gene of interest 
which is to be made transcriptionally active. The specific 
regions A and B of the endogenous gene are selected so as 
to be upstream and downstream, respectively, of the 
specific position at which it is desired for the regulatory 
segment to be inserted. Although these regions are 
separated in the construct they are preferably contiguous 
in the endogenous gene. There may be occasions where non- 
contiguous portions of the genome are utilized as targeting 
segments, for example, where it is desired to delete a 
portion of the genome, such as a negative regulatory 
element. 
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While two targeting regions, A and B, are 
preferred in order to increase the total regions of 
homology and thus increase recombination efficiency, the 
process of the present invention also comprehends the use 
of only a single targeting region, in its simplest form 
(when only the regulatory segment F and the selectable 
marker gene C and promoter C are to be inserted), a 
circular piece of DNA is employed which contains these 
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elements along with the targeting DNA (see Figure 4). in 
this way, the homologous region (B) hybridizes with its 
genomic counterpart. Segments C, C and F are inserted 
within the B portion of the cognate gene following the 
5 crossover event. 

When it is desired for the DNA regulatory 
sequence to be inserted upstream of the gene of interest, 
as, for example, when it is desired to activate and express 
a normally transcriptionally silent gene, the region of 
10 homology is preferably homologous to a non-coding portion 
of the genome upstream of the coding portions of the gene 
of interest. When two targeting regions are present, the 
downstream region (A) may include a portion of the coding 
region, although it is preferred that it, too, be totally 
i5 upstream of the coding region. It is further preferred 
that the homologous regions be chosen such that the DNA 
regulatory sequence will be inserted downstream of the 
native promoter for the gene of interest, particularly if 
the native promoter is a negative promoter rather than a 
20 turned-off positive promoter. 

The size of the targeting regions, i.e., the 
regions of homology, is not critical, although the shorter 
the regions the less likely that they will find the 
appropriate regions of homology and recombine at the 
25 desired spot. Thus, the shorter the regions of homology, 
the less efficient is the homologous recombination, i.e., 
the smaller the percentage of successfully recombined 
clones. It has been suggested that the minimum requirement 
for sequence homology is 25 base pairs (Ayares et al, pnas. 
30 USA., 83:5199-5203, 1986). Furthermore, if any of the other 
elements of the construct are also found in the genome of 
the host cell, there is a possibility of recombination at • 
the wrong place. However, in view of the excellent 
positive and negative selectability of the present 
35 invention, it can be successfully practiced even if the 
efficiency is low. The optimum results are achieved when 
the total region of homology, including both targeting 
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regions, is large, for example one to three kilobases. As 
long as the regulatable segment F can be operatively linked 
to the gene of interest there is no limit to the size of 
the targeting region, and particularly the upstream 
targeting region B. 

It can easily be empirically determined whether 
or not the targeting regions are too large or the 
regulatable segment F spaced too far from the coding region 
of the gene to be operatively linked thereto, m such a 
case, the regions A and B can be made homologous to a 
different section of the gene of interest and the process 
repeated until the regulatable segment F is properly 
inserted so as to be operatively linked to the gene of 
interest. For example, the restriction site of combined 
region A-B of the endogenous gene can be changed and the 
process repeated, once the concept of the present 
invention is known, along with the techniques disclosed 
herein, one of ordinary skill in this art would be able to 
make and use the present invention with respect to any 
given gene of interest in any cell line or microorganism 
without use of undue experimentation. 

Region C is a positive selectable marker gene 
which is capable of rendering the transfected cell line 
resistant to a normally toxic environment. Examples of 
such genes are adenosine deaminase (ADA) , aminoglycoside 
phosphotransferase (neo) , dihydrof olate reductase (DHFR) , 
hygromycin-B-phosphotransf erase (HP-H) , thymidine kinase ' 
(tk) , xanthine-guanine phosphoribosyltransf erase (gpt) , 
multiple drug resistance gene (MDR) , ornithine 
decarboxylase (ODC) and N 7 (phosphonacetyl) -L-aspartate 
resistance (CAD) . 

In addition to the positive selectable marker 
gene, an amplifiable gene is also optionally included in 
the construct at region D. Amplifiable genes are genes 
that lead to an increase in copy number when under 
selective pressure. The copy number of a gene positioned 
adjacent to the amplifiable gene will also increase. 
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Amplifiable genes that can be utilized include DHFR, MDR, 
ODC, ADA and CAD. The members of the positive selectable 
marker gene group and those of the amplifiable gene group 
overlap so that, in theory, instead of using two genes, one 
5 for positive selection and one for amplification, one gene 
could be used for both purposes. However, since most cell 
lines contain endogenous copies of these amplifiable genes, 
the cells will already be somewhat resistant to the 
selection conditions and distinguishing the cells which 
10 have transfected DNA from those which do not receive 

transfected DNA can be difficult. Thus, in instances where 
an amplifiable gene is desired, a positive selection gene 
which is dominant, such as HPH, gpt, neo and tk (in tk- 
cells) , should also be included in the construct. For some 
15 applications it may be possible or preferable to omit the 

amplifiable marker. For instance, the gene of interest may 
not need to be amplified as, for example, when 
transcriptional activation by the heterologous DNA 
regulatory sequence is sufficient without amplification. 
20 Also, if the homologous recombination efficiency is very 

low, it may be necessary to leave out the amplifiable gene 
since the ratio of non-homologous DNA to homologous DNA is 
directly related to the homologous recombination efficiency 
(Letsou, Genetics, 117:759-770," 1987). It is also* possible 
25 to eliminate the positive selection gene and select cells 
solely by screening for the production of the desired 
protein or mRNA. However, it is preferred in most cases to 
include at least the positive selection gene. 

Region E of the construct is a negative 
30 selectable marker gene. Such a gene is not expressed in 
cells in which the DNA construct is properly inserted by 
homologous recombination, but is expressed in cells in 
which the DNA construct is inserted improperly, such as by 
random integration. One such gene is the Herpes Simplex 
35 Virus thymidine kinase gene (HSVtk) . The HSVtk has a lower 
stringency for nucleotides and is able to phosphorylate 



WO 91/09955 



PCT/US90/07642 



- 20 - 



nucleotide analogs that normal mammalian cells are unable 
to phosphorylate. If the HSVtk is present in the cells, 
nucleotide analogs such as acyclovir and gancyclovir are 
phosphorylated and incorporated into the DNA of the host 
5 cell thus killing the cell. The presence of the negative 
selectable marker gene enables one to use the positive- 
negative selection for homologous recombination as 
described by Mansour et al ( Nature . 336:348-352, 1988). 
Capecchi uses a strategy which takes advantage of the 

10 differing modes of integration that occur when linearized 
vector DNA inserts via homologous recombination as compared 
to when it inserts by random integration. - If the vector 
DNA inserts randomly, the majority of the inserts will 
insert via the ends (Folger et al, Mol. Cell. Biol . . 

15 2:1372-1387, 1982; Roth et al, Mol. Cell. »<m. f 5:2599- 

2607, 1985; and Thomas et al, £eH, 44:419-428, 1986). On 
the other hand, if the vector inserts by homologous 
recombination, it will recombine through the regions of 
homology which cause the loss of sequences outside of those 

20 regions. 

Using the construct depicted in Figure l as an 
example, the mode of integration for homologous 
recombination versus random integration is illustrated in 
Figures 2A and 2B. In the case of non-homologous 

25 recombination (Figure 2A) , the vector is inserted via the 
ends of the construct. This allows region E, in this case 
the HSVtk gene, to be inserted into the genome. However, 
when homologous recombination occurs (Figure 2B) , the HSVtk 
gene is lost. The first round of selection uses the 

30 appropriate drug or conditions for the positive selection 
present within the construct. Cells which have DNA 
integrated either by homologous recombination or random 
integration will survive this round of selection. The 
surviving cells are then exposed to a drug such as 

35 gancyclovir which will kill all the cells that contain the 
HSVtk gene. In this case, most of the cells in which the 
vector integrated via a rand m insertion contain the HSVtk 
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gene and are killed by the drug while those in which the 
vector integrated by homologous recombination have lost the 
HSVtk gene and survive.' This allows the elimination of 
most of the cells which contain randomly integrated DNA, 
5 leaving the majority of the surviving cells containing DNA 
which integrated via homologous recombination. This 
greatly facilitates identification of the correct 
recombination event. 

The negative selection step can also be 
10 eliminated if necessary. It will require that the 

screening step be more labor intensive involving the need 
for techniques such as polymerase chain reaction (PCR) or 
immunological screening. 

The sixth region (F) contains the DNA regulatory 
15 segment that will be used to make the gene of interest 

transcriptionally active. The appropriate DNA regulatory 
segment is selected depending upon the cell type to be 
used. The regulatory segment preferably used is one which 
is known to promote expression of a given gene in 
20 differentiated host cell line. For example, if the host 
cell line consists of pituitary cells which naturally 
express proteins such as growth hormone and prolactin, the 
promoter for either of these genes can be used as DNA 
regulatory element F. When inserted in accordance with the 
25 present invention, the regulatory segment will be 

operatively linked to the normally transcriptionally silent 
gene of interest and will stimulate the transcription 
and/or expression of that gene in the host cell line. Also 
usable are promiscuous DNA regulatory segments that work 
across cell types, such as the rous sarcoma virus (RSV) 
promoter. As long as the regulatory segment stimulates 
transcription and/or expression, or can be induced to 
stimulate transcription and/or expression, of the gene of 
interest after being inserted into the host cell line so as 
35 to be operatively linked to the gene of interest by means 
of the present invention, it can be used in the present 
invention. It is important when joining the regulatory 
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segment F to the targeting segment A that no starting codon 
be accidentally introduced into the sequence since such an 
occurrence could alter the reading frame of the gene which 
is desired to be expressed. Of course, the construct must 
be constructed and inserted such that the regulatory 
segment F is operatively linked to the gene of interest. 

The DNA regulatory segment, region F, need not be 
present in instances where it is desired to enhance or 
amplify the transcription of a gene which is already 
expressing in the cell line of interest, either because it 
naturally expresses in that cell line or because the cell 
line has previously had its DNA manipulated to cause such 
expression. In such instances, insertion of an amplifiable 
gene, region D, preferably with the positive selectable 
!5 marker gene, region C, and optionally also with a negative 
selectable marker gene, region E, will be sufficient to 
increase the copy number of the gene of interest and thus 
enhance the overall amount of transcription. 
Alternatively, a new regulatory segment, region F, 
inherently promoting an increased (or otherwise modified) 
rate of transcription as compared to the existing 
regulatory region for the gene of interest, may be included 
to further enhance the transcription of the existing 
expressing gene of interest. Such a new regulatory segment 
could include promoters or enhancers which improve 
transcription efficiency. 

Regions C, D* and E» are promoter regions which 
are used to drive the genes in regions C, D, and E, 
respectively. These promoters are transcriptionally active 
in the cell line chosen and may be the same or different 
from the promoter in region F used to drive the endogenous 
gene of interest. The specific direction of transcription 
specified in Fig. 1 is not critical. Those of ordinary 
skill in this art can determine any appropriate placement 
35 of the genes C, D and E and their promoters C, D' and E» 
such that the pr moters will stimulate expression of their 
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associated genes without simultaneously disrupting in any 
way the expression of the gene of interest or any of the 
other genes of the construct. 

The present invention may be illustrated by 
5 reference to the activation of the rat thyrotropin beta 

subunit (TSHB) in GH, (ATCC CCL 82) , GH 3 (ATCC CCL 82.1) or 
GH 4 C1 cell lines (GH) . GH cell lines are derived from a 
radiation induced pituitary tumor in rats designated MtT/W5 
(Takemoto, Cancer Res, r 22:917, 1962) and adapted to grow 
10 in culture by Tashjian et al. Endocrinology . 82:342-352, 
1968. These cell lines may be subcloned and screened for 
■ their ability to produce growth hormone and TSH6 . Such 
screening may preferably be by means of Northern blot 
analysis to determine whether mRNA for the rat growth 
15 hormone gene is present and to establish that there is no 

mRNA for the TSHB gene being produced. The cell lines may 
also be screened by Southern analysis to determine that 
there is at least one copy of the TSHB gene present within 
the genome. Only the GH cell lines that produce growth 
20 hormone and not TSHB, but contain a copy of the TSHB gene, 
are used. 

The specific homologous recombination vector for 
use in GH cells may be designed in the following manner 
(Figure "3). Region A may consist of the 5» upstream 

25 untranslated region of the TSHB* gene defined by the Hindlll 
fragment which stretches from -74 to -2785 and region B may 
contain the DNA fragment that stretches from the -2785 
Hindlll site to a Ncol site approximately 2.1 kb further 
upstream as described by Carr et al ( J. Biol, Chem. r 

30 262:981-987, 1987) and Croyle et al ( DNA . 5:299-304, 1986). 
The positive selection gene (region C) may be a 1067 bp 
Bglll-Smal fragment derived from the plasmid pSV2neo (ATCC 
No. 37,149) (Southern et al, J, Mol. AppI. Gen. . 1:327- 
341, 1982). The neo gene may be driven by the Rous 

35 Sarcoma Virus (RSV) promoter (region C 1 ) which is derived 
from the Ndel-Hindlll fragment from the plasmid pRSVcat 
(ATCC No. 37,152) (Gorman et al, PNAS , 79:6777-6781, 
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1982). In this example, no amplifiable marker need be 
used and thus there need be no region D in order to 
optimize the efficiency of the homologous recombination. 
The efficiency is inversely related to the proportion of 
non-homologous to homologous sequences present in the 
construct (Letsou et al, Genetics r 117:759-770, 1987). 
Region E, or the negative selection gene, may consist of 
the HSVtk gene which is a 2 kb Xho fragment obtained from 
the plasmid pMCITK plasmid (Capecchi et al, Nature . 
336:348-352, 1988). The HSVtk gene in that construct may 
be driven by the polyoma virus promoter and enhancer 
(region E») as constructed by Thomas et al ( Cell f 51:503- 
512, 1987). In a second DNA construct the polyoma promoter 
may be replaced by the RSV promoter described above. The 
DNA regulatory sequence used to activate the TSHfi gene may 
be either the RSV promoter or the rat growth hormone 
promoter. The rat growth hormone promoter consists of the 
SacI-EcoRI fragment obtained from the plasmid pRGH237CAT 
(Larson et al, PNA£, 83:8283-8287, 1986). The RSV promoter 
has the advantage of being usable in other cell lines 
besides GH cells, while the GH promoter is known to be 
active in GH cells and can be specifically induced (Brent 
et al » J- Biol, chem., 264:178- 182, 1989). The rat growth 
hormone promoter and the RSV promoter may be" inserted at 
location F in separate constructs. 

Following transfection of the above construct 
into a GH cell line, the cells may be grown in media that 
contains G418. This will allow only those cells which have 
integrated plasmid DNA into the genome either by homologous 
recombination or random integration to grow. The surviving 
cells may be grown in media that contains gancyclovir. The 
majority of the cells that survive this round of selection 
will be those in which the vector plasmid DNA is integrated 
via homologous recombination. These cells may be screened 
to demonstrate that they are producing xnRNA which 
corresponds t the TSH6 gene and that they are producing 
the TSHB protein. The genomic DNA may also be sequenced 
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around the area of insertion of the heterologous promoter 
to insure that the proper recombination event occurred. 

EXAMPLE - Activation of TSHB Gene in Rat Pituitary Cells 
5 Using the following protocol, thyrotropin beta 

subunit (TSHB) gene transcription, which normally does not 
occur in the rat GH 3 pituitary cell line, was activated in 
those cells by using the process of homologous 
recombination to target an activating element upstream of 

10 the TSHB coding region. The Rous Sarcoma Virus (RSV) 
promoter is known to function efficiently in GH3 cells 
(Christian Nelson et al, Nature . 322:557-562 (1986); Zheng- 
Sheng Ye et al, The Journal of Biological Chemistry , 
263:7821-7829 (1988)) and therefore was chosen as the 

15 activating element. A plasmid vector was constructed which 
contained the RSV activating element, portions of the 5' 
flanking region of the TSHB gene locus, and a selectable 
drug marker, aminoglycoside phosphotransferase gene (NEO) , 
for the isolation of transfected cell populations. 

20 Ribonucleic acid (RNA) was extracted from pooled drug 
resistant GH 3 cell populations and converted to 
complementary deoxyribonucleic acid (cDNA) . The cDNA was 
then screened by the technique of polymerase chain reaction 
(PCR) for the presence of TSHB cDNA. The constuction of 
25 the homologous recombination vectors and the control 
vectors is outlined below along with the experimental 
procedures and results, 

PLASMID CONSTRUCTION 
30 Homologous Recombination fHRV Backbone Vector 
' (PRSVCATNEO) . 

The Rous Sarcoma Virus (RSV) promoter was derived 
from the plasmid pRSVCAT (Cornelia M. Gorman et al., 
Proceedi ngs of the National Academy of Science. 79:6777- 
35 6781 (1982)) (figure 5) by isolating the 580 base pair (bp) 
Ndel - Hindlll fragment containing the functional promoter 
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unit. The ends of this fragment were blunted using DNA 
polymerase I Klenow fragment and Xbal linkers ligated to 
the blunt ends. After digestion with Xbal restriction 
endonuclease and gel purification, the resulting fragment 
5 was ligated into the Xbal site of pUC18 . A bacterial 
colony harboring a plasmid with the RSV insert in the 
orientation shown in figure 6 was designated pRSV. The 
aminoglycoside phosphotransferase gene (NEO) was cloned 
from pSV2NEO (P.J. Southern et al., Journal of Molecular- 

10 and Appl^d Genet jcs, 1:327-341 (1982)) by Isolating the 
Bglll and BamHI fragment (figure 7) and ligating that 
fragment into the BamHI site of pRSV (figure 6K A plasmid 
containing the NEO gene in the orientation shown in 
figure 8 was picked and designated pRSVNEOBAM. pRSVNEOBAM 

15 was digested with Smal and the 4328 bp fragment containing 
the RSV promoter region, the majority of the NEO gene and 
pUC18 was isolated by gel electrophoresis. The Smal ends 
of this fragment were Xhol linkered, cleaved with Xhol 
restriction enzyme and the plasmid recircularized by 

20 ligation. The resulting plasmid is shown in figure 9 and 
is called pRSVNEO. This last cloning step resulted in the 
deletion of a 786 bp fragment from the 3 1 end of the NEO 
fragment which is not necessary for its functional 
expression. This construction yields a plasmid in which 

25 the NEO gene is transcriptionally driven by the RSV 
promoter. 

Next the Ndel site located 5 r of the RSV promoter 
in pRSVCAT (figure 5) was converted to a Sail site. This 
was accomplished by digesting pRSVCAT with Ndel, filling in 

30 the ends using DNA polymerase I Klenow fragment and 

ligating Sail linkers to the resulting blunt ends. The 
linkers were digested to completion with Sail and the 
plasmid recircularized by ligation. Into the newly 
constructed Sail site was cloned the Sail - Xhol fragment 

35 from pRSVNEO (figure 9) containing th RSV promoter and the 
NEO. gene. A plasmid with the RSV promoter and NEO fragment 
oriented as shown in figure 10' was isolated and named 
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pRSVCATNEO. This plasmid when transfected into GH 3 cells 
was capable of conferring G418 resistance to those cells, 
demonstrating the ability of the RSV promoter to drive 
transcription of the NEO gene and the ability of that RNA 
to be translated into a functional protein (data not 
shown). Total RNA from the stable transfectants above was 
analyzed by polymerase chain reaction (PCR) to determine 
whether the CAT gene was being transcribed. PCR results 
showed that the CAT gene was indeed being transcribed in 
all the G418 resistant colonies tested (data not shown) , 
indicating that the RSV promoter 5' of the CAT gene was 
capable of driving transcription of a gene located 3 1 to 
it. This is important because this RSV promoter will be 
responsible for driving transcription of the TSHB gene when 
the TSHB HR vector described below integrates via 
homologous recombination into the GH 3 genome. 

TSHB HR Vector 

A vector capable of integrating into the GH 3 
20 genome by homologous recombination was created by inserting 
two stretches of the 5' flanking regions of the thyrotropin 
beta subunit (TSHB) gene into the unique Sail and Hindlll 
sites contained in pRSVCATNEO (figure 10). . A rat spleen 
genomic library containing inserts of 15 kilobases (kb) or 

25 greater cloned into lambda DASH was obtained from 

Stratagene, San Diego, CA. Using standard protocols 
( Current protocols in Molecular Biology , pp. 1,9.1 - 1.13.6, 
6.1.1 - 6.4.10) a 15.3 kb clone of the rat genomic TSHB 
gene including 9kb of sequence 5' of the first exon was 

30 isolated. The 15.3 kb fragment consisted of two Xbal 

fragments, a 10.6 kb fragment corresponding to the 5* end 
of the 15.3 kb fragment and a 4.7 kb piece corresponding to 
the 3 1 region of the 15.3 kb fragment (figure 11). Both of 
these Xbal fragments were subcloned into pUC18 and plasmids 

35 containing inserts in both orientations were isolated. The 
2.3 kb Xbal - Hindlll fragment c ntained in the 4.7 kb Xbal 
fragment (figure 11) was purified and the Xbal site of this 
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fragment was converted to a Hindlll site by filling in the 
ends with Klenow fragment and ligating on Hindlll linkers. 
This fragment was ligated into the unique Hindlll site 
contained in pRSVCATNEO (figure 10) . An isolate 
corresponding to a plasmid with the 2.3 kb insert in the 
correct orientation as shown in figure 12 was assigned the 
name pRSVCATNEOTSHB3 . 

The subcloned 10.6 kb Xbal fragment from the rat 
TSH6 clone (figure 11) was isolated and the Xbal ends 
converted to Sail sites by blunt ending the fragment with 
DNA polymerase I Klenow fragment and attaching Sail 
linkers. -This 10.6 kb Sail fragment was then cloned into 
the Sail site of pRSVCATNE0TSHB3 (figure 12) . A plasmid 
containing the insert in the correct orientation was 
identified and named pRSVCATNEOTSHB3-5XbaI (figure 13) . 
The latter plasmid has been deposited in the American Type 
Culture Collection, Rockville, MD, and has received 
depository number ATCC 40933. For the purpose of this 
deposit, the plasmid was renamed pHRTSH. This deposit was 
made in accordance with all of the requirements of the 
Budapest Treaty. 

CELL LINE 

GH 3 cells are a subcloned population' of MtT/W5 
which was derived from a radiation induced pituitary tumor 
in rats (B.K. Takemoto, Cancer Res^T-oh , 22:917 (1962)) 
and adapted to growth in culture by Tashjian et al., 
Endocrinology , 82:342-352 (1968). The GH 3 cells were 
obtained from the American Type culture Collection cell 
bank and are maintained in culture by growth in Dulbecco's 
Modified Eagle's Medium (DMEM) + 15% horse serum (HS) + 
2.5% fetal bovine serum (PBS) + i% L-glutamine (GH 3 media) 
at 37«C in 5% C0 2 . 
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DNA PREPARATION 

Larae-Scale Preparation of Plasmid DNA 

All plasmids used for stable transfections were 
purified by using the alkaline lysis method for large-scale 
5 plasmid DNA purif ication as described in Current Protocols 
in Molecular Biology , vol. 1, pp. 1.7.1 - 1.7.2* DNA 
isolated by the alkaline lysis method was further purified 
by double banding in a cesium chloride gradient as also 
described in Current Protocols in Molecular Biology , vol. 
10 1, pp. 1.7.5 - 1.7.7. 

Prior to transfection, the HR vectors were 
digested with either Aatll or Apal. Apal was used to 
linearize the control plasmid pRSVCATNEO and Aatll to 
linearize the HR plasmid pRSVCATNEOTSHB3-5XbaI. The 
15 location of the cleavage sites of Apal and Aatll can be 
seen in figures 10 and 13 respectively. After digestion 
with the appropriate restriction enzyme, the reaction was 
phenol/chloroform extracted, chloroform extracted, ethanol 
precipitated, and washed once with 70% ethanol. The 
20 plasmids were then resuspended in sterile deionized water 
(dH 2 6) to a concentration of 1 microgram/microliter (/xg/jul) 
as determined by absorbance at OD 260 . In an attempt to 
increase the transfection efficiency and/or the ratio of 
homologous recombination positives to those that" were due 
25 to random integration, pRSVCATNEOTSHB3-5XbaI was digested 
with Apal. Digestion with Apal cuts at three separate 
sites in pRSVCATNE0TSHB3-5XbaI and removes all regions of 
the vector except those necessary for homologous 
recombination (figure 13). After digestion with Apal, the 
30 reaction was electrophoresed on a 0.8% agarose gel and the 
top band corresponding to the 10,992 bp fragment containing 
the two 5' flanking regions of the TSHfi gene, the RSV 
promoter - NEO region and the TSH5 gene-activating RSV 
promoter was isolated from the gel by electroelution into 
35 dialysis tubing. The electroeluted DNA was further 

purified by using an elutip minicolumn (Schleicher and 
Schuell) with the manufacturer's recommended standard 
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protocol. The DNA was eluted from the column, ethanol 
precipitated, washed with 70% ethanol and resuspended to a 
concentration of 1 nq/pl. 

5 STABLE TRANSFECTIONS 

Calcium Phosphate Transf eetinn 

48 hours prior to transfection 3 x io 6 GH 3 cells 
were plated on 10 centimeter (cm) dishes. For each dish, 
10 ng of vector DNA along with 30 ng of sonicated salmon 
10 sperm DNA was added to 0.5 milliliters (ml) of tranfection 
buffer. The transfection buffer was prepared by combining 
4g NaCl, 0.185g Kcl, 0.05g Na 2 HP0 4 , 0.5g dextro.se, 2.5g 
HEPES and dH z O to a final volume of 500 ml and bringing the 
pH to 7.5. 31 /xl of 2 molar (M) CaCl 2 was added to the 0.5 
15 ml of DNA + transfection buffer and vortexed. This 

solution was allowed to stand at room temperature for 45 
minutes. When the DNA - CaCl 2 - transfection buffer was 
ready, the GH 3 medium was removed from the 6H 3 cells and 
the DNA - CaCl 2 - transfection buffer was layered over the 
20 cells. The cells were allowed to stand at room temperature 
for 20 minutes. After 20 minutes, 5 ml of GH 3 medium was 
added and the plates were incubated at 37°C for 6 hours. 
The cells were then shocked by aspirating off the medium 
and adding 5 ml of fresh transfection buffer containing 15% 
25 glycerol for 3.5 minutes. The cells were rinsed 2x with 
PBS and fed with 10 ml of GH 3 medium. 48 hours post- 
transfection, the medium was removed and 10 ml of GH 3 
medium containing 400 fig /ml G418 was added. 

30 Electroporation 

Electroporation was carried out using a BTX 300 
Transf ector with 3.5 millimeter (mm) gap electrodes, l x 
10 7 GH 3 cells growing in log phase were removed from their 
plates by trypsinization, pelleted by centrifugation and 

35 washed once with PBS. Cells were resuspended in 1.0 ml of 
PBS and transferred to 2.9 ml Ultra-uv disposable cuvettes 
(American Scientific Pr ducts) on ice. io fig of DNA was 
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added to the cells, mixed and placed back on ice for 5 
minutes. After 5 minutes the electrodes were placed in the 
chamber and the cells were electroporated at a setting of 
750 microfarads with a 200 volt pulse. The cuvette was 
5 then returned to ice for 10 minutes. Cells were 
transferred from the cuvette to 9 ml of GH 3 medium 
containing 1% penicillin and 1% streptomycin at room 
temperature in a 15 ml conical tube and allowed to stand 
for 10 minutes. The total electroporation of 1 x 10 7 cells 
10 was transferred to three 10 cm plates giving approximately 
3 x 10 6 cells per plate. After 48 hours, the GH 3 medium 
containing 400 /xg/ml G418 was added. 



15 



Transf ection of GH 3 cells with pRSVCATNE0TSHB3-5Xbal (Aatll 
cut) . P RSVCATNEOTSHB3-5XbaI (Apal cut) and dRSVCATNEO (Anal 
cut) 



pRSVCATNE0TSHB3-5XbaI (Aatll cut), 
pRSVCATNEOTSHB3r-5XbaI (Apal cut) and pRSVCATNEO (Apal cut) 
plasmids were transf ected into GH 3 cells along with a no 
DNA control using both the calcium phosphate protocol and 

20 the electroporation protocol. 48 hours after transf ection, 
the cells were put under G418 selection. Approximately 14 
to 21 days later the colonies became visible by eye on the 
10 cm dishes and were counted. In all of the no DNA 
controls, there were no visible colonies, demonstrating 

25 that the G418 selection was working and that the presence 
of a plasmid containing the RSV - NEO region was necessary 
to confer G4 18 resistance. At this time, colonies were 
picked and pooled by isolating regions on the 10 cm dish 
with 17 millimeter wide cloning rings. These large cloning 

30 rings encompassed between 10 and 70 colonies depending on 
the density of the colonies per plate and allowed the GH 3 
cells in that isolated region to be removed and pooled at 
the same time by trypsinatidn. The trypsinized colonies in 
each ring were transferred to 6 well plates and allowed to 

35 grow in GH 3 media containing G418. After reaching 70% to 
80% confluence, 80,000 cells were transferred to a 24 well 
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plate and the remaining cells cryopreserved for further 
testing at a later date. The cells in the 24 well plates 
were grown until they reached 50% to 80% confluence. Total 
RNA was then harvested from these GH 3 cells by the 
following procedure. 

RNA ISOLATION FROM TRANSFECTED GH 3 CELLS GROWN IN 24 WELL 
PLATES 

The following is a modification of the protocol 
described by Chomczynski and Sacchi, Anal. Blnnhm., 
162:156-159 (1987). The media covering the GH 3 cells in 
the 24 well plates was removed and the cells washed with l 
ml of PBS. l ml of GTC solution was added and the cells 
were incubated at room temperature for 5 minutes. GTC 
15 solution was prepared by dissolving 250 g of guanidium 
thiocyanate (Fluka) in 293 ml of dH 2 0, and then adding 
17.6 ml of 0.75 M Na citrate pH 7.0 and 26.4 ml of 10% 
sarcosyl (L-Lauryl sarcosine) . Just prior to use, 360 M i 
of B-mercaptoethanol per 50 ml GTC solution was added. 
20 After 5 minutes at room temperature, the 1 ml of GTC-cell 
lysate was transferred to a Sarstedt 55.518 snap-cap tube 
containing 2 ml of GTC solution. To each tube was added 
300 ixl of 2M sodium acetate pH 4.0 and the tube vortexed. 
Next, 3 ml of dH 2 0 saturated phenol was added and the tubes 
25 were vortexed again. To each tube was added 600 jttl of 

chloroform: isoamyl alcohol (49:1) and the tube was shaken 
by hand for 10 seconds and placed on ice for 15 minutes. 
The tubes were then centrifuged' in a Sorval RC-5B using a 
SM24 rotor at 8000 revolutions per minute (RPM) for 20 
30 minutes at 4»C. The aqueous phase was transferred to a 
fresh Sarstedt tube containing 3 ml of isopropanol and 
placed at -20'C for l hour. After l hour the tubes were 
spun in a Sorval RC-5B using a SM24 rotor at 8000 rpm for 
20 minutes at 4-C. The supernatants were removed and the 
pellets resuspended in 500 nl of GTC solution. The 
resuspended RNA was transferred to a 1.5 ml eppendorf tube 
to which 500 Ml of isopropanol was added. The tubes were 
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once again placed at -20 °C for l hour. The eppendorf tubes 
were spun for 5 minutes in a microf uge 'and the supernatant 
discarded. The pellet was washed with 70% ethahol 2 times 
and allowed to dry until the ethanol had completely 
5 evaporated. The pellet was resuspended in 20 fxl of diethyl 
pyrocarbonate (depc) treated water and heated to 65 °C for 5 
minutes. This RNA was then used to make cDNA in one of the 
two procedures described below. 

10 CDNA REACTIONS 
Method 1 

First strand cDNA was synthesized from 2.5-6.0 
microliters of total RNA (approximately 0.5-6 micrograms) 
in a reaction volume of 10-20 microliters. The total RNA 
15 was obtained by the extraction method described above, and 
was denatured for 5-10 minutes at 70 °C and quick chilled on 
ice before adding the reaction components. The reaction 
conditions were 50 millimolar (mM) Tris-HCl (pH 8.3), 10 mM 
MgCl 2 , 10 mM DTT, 0.5 mM each of dCTP, dATP f dGTP, and dTTP 
20 (Pharmacia), 40 mM KC1, 500 units/ml RNasin (Promega 

Biotech), 85 jzg/ml oligo(dT) t 2 . , 8 (Collaborative Research, 
Inc.), and 15,000-20,000 units/ml Moloney murine leukemia 
virus reverse transcriptase (Bethesda Research 
Laboratories) incubated at 37 °C for 60 minutes. The 
25 reaction was terminated by the addition of EDTA to 40 mM, 
and the nucleic acid was precipitated by adding sodium 
acetate to a concentration of 0.3 M and two volumes of 
ethanol. The precipitate was allowed to form at 0°C for 30 
minutes and was pelleted by centrifugation in a microf uge 
30 at 14,000 rpm for thirty minutes. The pellet was washed 
with 70% ethanol, dried, and resuspended in depc treated 
water to a volume of 15-25 microliters. 

Method 2 

35 Conditions for first strand synthesis of cDNA 

from RNA were adapted fr m Carol A. Brenner et al, 
BioTechnicrues. Vol. 7, N . 10, pp. 1096-1103 (1989). 1 /il 
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of total RNA from the RNA prep procedure described above 
was added to 9 /il of reaction buffer in a 0.5 ml eppendorf 
tube. The reaction buffer consisted of 200 units of 
Moloney murine leukemia virus reverse transcriptase (MMLVRT 
5 Bethesda Resesarch Labs) , and a final concentration of the 
following reagents: 70 mM Tris.HCl pH 8,8, 40 mM KC1, 0.1% 
Triton X-100, 1 mM of each dNTP, 4 mM MgCl 2 , and 0.45 OD 260 
units of random hexamers (Pharmacia) . After mixing, the 
tubes were incubated at room temperature for 10 minutes and 
10 then placed at 42 °C for 1 hour. After 1 hour the tubes 

were heated to 90 °C for 1 minute to deactivate the MMLVRT 
"and then cooled to room temperature. 

POLYMERASE CHAIN REACTION (PCR) AMPLIFICATION OF RNA FROM 
15 GH 3 CELLS 

The following primers were used to amplify, by 
PCR, TSHfi cDNA synthesized from RNA transcripts produced by 
the GH 3 cells as a result of the HR plasmids activating the 
endogenous TSH6 gene by homologous recombination. 

20 primer 5 1 3 1 

TSH65 AGTATATGATGTACGTGGACAGG 
TSH03 CACTTGCCACACTTGCAGCTCAGG 

Figure 14 shows the regions of the TSHfl gene to 
25 which each primer corresponds. 

PCR REACTION CONDITIONS 

All PCR reactions were performed in the Ericomp 
Twinblock thermocycler. If PCR amplification was to be run 

30 on cDNA made by method 2, 40 pi of additional reaction mix 
was directly added to the 10 ftl of the cDNA reaction 
bringing the total volume up to 50 pi. The final 
concentrations of reagents in the 50 pi were 70 mM Tris.HCl 
pH 8.8, 40 mM KC1, 0.1% Triton X- 100, 2.25 units Taq 

35 polymerase (Pharmacia), 0.2 micromolar (/iM) each primer, 
200 each dNTP, and 0.8 mM MgCl 2 . 
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If PCR was to be performed on cDNA made by method 
1 above, 5 to 10 fxl of the resuspended cDNA was added to 40 
to 45 Ml containing filial concentrations of the following: 
70 mM Tris.HCl pH 8.8, 40 mM KC1, 0.1% Triton X-100, 2.25 
5 units Tag polymerase, 0.2 /*M each primer, 200 fM each dNTP, 
and 0.8 mM MgClj . 

The reactions were then subjected to the 
following PGR cycles. 

1 minute at 94 °C. 
10 30 seconds at 55 °C. 

2 minutes at 72°C. 

The above cycle was repeated 30* to 40 times. 10 
/xl of each reaction mix was run on a 6% polyacrylamide gel 
and screened for the presence of a 247 bp PCR fragment 
15 which would indicate the presence of the properly spliced 
mRNA for TSHB. 

PCR RESULTS FOR AMPLIFICATION OF TSHB RNA FROM GH 3 CELLS 
AND RAT PITUITARY GLAND TOTAL RNA 

20 To determine whether GH 3 cells normally 

synthesize TSHB RNA, cDNA from untransf ected GH 3 cells as 
well as cDNA from rat pituitary glands was subjected to the 
above PCR reaction conditions. The correct 247 bp band 
indicative of the presence of TSHB mRNA was visible in the 

25 positive control of the rat pituitary gland sample but no 
band was visualized from the GH 3 cell total RNA sample even 
after 60 cycles (data not shown) . 

TRANSFECTION RESULTS 
30 The number of G418 resistant colonies present on 

the 10 cm dishes were tabulated between 14 and 21 days 
after addition of G418 to the media. 
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Transf ec tion Method Colonies per 10 cm dish 

PRSVCATNEO PRSVCATNEOTSHB3 -5XBA1 
Apal cut Aat2 cut 
5 Calcium phosphate 1 48 13 29 

Calcium phosphate 2 — 21 58 

Electroporation 1 — 1295 415 

Electroporation 2 — 1051 723 

10 Total RNA was harvested from the colony pools 

contained in the 24 well plates as described above. cDNA 
was made from these RNA preps and subjected to PCR 
amplification. The number of positive colonies producing 
TSHB mRNA was determined by the presence of a 247 bp 
15 fragment as visualized on a polyacrylamide gel. Each of 
the pools screened contained between 10 and 70 colonies. 
The estimated number of colonies per pool per transf ection 
was used to approximate the number of G418 resistant GH 3 
cell clones in which TSHB gene transcription was activated. 
If a pool tested positive, it was assumed that this 
represented one positive colony present in that particular 
pool . 

P 1 *™** G418 resistant colonies TSHft RNA positi yg 

25 pRSVCATNEO 60 0 

pRSVCATNEOTSHB3 -5XBA1 4942 3 

(Aat2 digested) 

PRSVCATNEOTSHB3 -5XBA1 8580 6 

(Apal digested) 

30 These results demonstrate the successful 

activation of the normally transcriptionally silent TSHB 
gene by the method of the present invention. While the 
number of colonies that are positive for TSHB transcription 
is small compared to the number of colonies that are G418 

35 resistant (approximately one out of every 10 3 G418 
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resistant colonies) , this result is generally consistent 
with rates reported for other homologous recombination 
experiments (Michael Kriegler, Gene Transfer and Expression 
A Laboratory Manual . Stockton Press, New York, NY (1990), 
5 pp. 56 - 60)*. It has been generally observed that the 
homologous recombination rate seems to be proportional to 
the rate of transcription of the targeted gene (M. Frohman 
and G. Martin, Cell . 56:145 (1989); L. Mansour et al, 
Nature, 336:348 (1988)). It should be noted that the rate 
10 which has been demonstrated is three orders of magnitude 
higher than what might be expected for random mutation 
turning oh the TSHB gene. 

To ensure that the results for each colony pool 
were reproducible and that the activation of RNA 
15 transcription was stable, colony pools previously frozen 
away corresponding to pools which tested positive in the 
first screening were thawed and expanded in culture. The 
freshly thawed GH 3 positive pools were seeded in T 25 
tissue culture flasks and expanded until the cells reached 
20 70% to 80% confluence. 80,000 cells were then plated in 24 
well plates from each flask and grown until they were 50% 
to 70% confluent. RNA was extracted from the cells, 
converted into cDNA, and screened once again for the 
presence of TSHB RNA by running 10 til of each PCR reaction 
25 on a 6% polyacrylamide gel. Figure 15 shows the results 
of representative PCR reactions from the second screening 
as visualized on a polyacrylamide gel by ethidium bromide 
staining and fluorescence. Lanes 1, 2, and 3 contain the 
PCR reactions run on cDNA from GH 3 cells which had been 
30 trarisfected by pRSVCATNEO. pRSVCATNEO contains no regions 
of homology to TSHB and thus is not capable of activating 
the TSHB gene by homologous recombination. As can be seen 
on the gel in figure 15, there are no bands corresponding 
to 247 bp in those lanes indicating that the TSHB gene is 
35 not activated. Lane 6 also contains a negative c ntrol. 
In that lane three pools were c mbined from samples f GH 3 
cells which had been transfected with pRSVCATNEOTSHB3-5XbaI 
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(Apal cut) but which were negative for transcription of the 
TSH5 gene on the first screening. The "absence of the 247 
bp fragment in lane 6 demonstrates that the presence of the 
transfected pRSVCATNEOTSHB3-5XbaI (Apal cut) plasmid 
integrated randomly in the genome is not capable of 
producing the 247 bp TSHfi PCR fragment. Lanes 7, 8, 9, and 
10 include PCR reactions run on cDNA made from total RNA 
harvested from rat pituitary glands in quantities per 
reaction of 25 nanograms, 100 nanograms, 200 nanograms, and 
400 nanograms, respectively. The presence in these lanes 
of the expected 247 bp band, produced from cDNA prepared 
from a rat tissue which normally expresses TSHfi, showed 
that the PCR reaction conditions were correctly 
optimized and that the PCR band obtained in lanes 4 and 5 
containing the homologous recombination TSHfi positives is 
of the correct size. Two pools transfected with 
pRSVCATNEOTSHB3-5XbaI (Apal cut) which were positive in the 
first screening, ApaI-107 in lane 4 and Apal-136 in lane 5, 
once again tested positive for TSHfi gene activation as 
demonstrated by the presence of the correct TSHfi PCR band 
amplified from cDNA made from the total RNA extracts from 
those pools proving that transcription of TSHfi gene has 
been stably activated. The presence of bands at 247 bp in 
" lanes 4 and 5 containing RNA from previous positives Apal- 
107 and Apal-136 and the absence of bands in the negative 
controls of pRSVCATNEO transfected GH 3 cells in lanes 1-3 
and the pRSVCATNEOTSHB3-5XbaI (Apal cut) negatives in lane 
6 demonstrated that the production of TSHfi RNA in a cell 
line that does not normally produce that RNA has been 
stably turned on by homologous recombination. 

The present invention is not limited to the cell 
line that is described herein. All cell lines have genetic 
information which is normally silent or inert. Most are 
able to express only certain genes. However, a normally 
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transcriptionally silent or inert gene of any such cell 
line can be activated to express the gene product in 
accordance with the present invention and any gene of the 
genome may have its expression characteristics modified in 
5 accordance with the present invention. Even previously 
transformed cell lines can be used as long as the previous 
transformation did not disrupt the gene of interest. The 
source of the cell line is not important. The cell line 
may be animal or plant, primary, continuous or immortal. 
10 Of course, it is desirable that any such cell line be 
stable and immortal so that after treatment with the 
technique in accordance with the present invention, 
expression can be commercialized. Cloned microorganisms, 
whether prokaryotic or eukaryotic, may also be treated by 
15 the technique of the present invention. 

While the present invention has been preferably 
described with respect to the expression of a normally 
transcriptionally silent or inert gene, the technique of 
the present invention is also applicable to the 
20 modification of the expression characteristics of a gene 
which is naturally expressed in the host cell line. For 
example, if it is desired to render the expression of a 
gene dependent upon culture conditions or the like so that 
expression can be turned on and off at will, an appropriate 
25 DNA regulatory segment, such as a regulatable promoter, can 
be inserted which imparts such characteristics, such as 
repressibility or inducibility. For example, if it is 
known that the cell type contains nuclear steroid 
receptors, such as estrogen, testosterone or 
30 glucocorticoid, or thyroxin receptors, one could use the 
steroid or thyroxin response elements as region F. Such a 
response element is any DNA which binds such receptor to 
elicit a positive response relative to transcription. Even 
if a cell is not naturally responsive to glucocorticoids, 
35 for example, a piece of DNA which encodes the 

glucocorticoid receptor c uld be added to the construct, or 
otherwis inserted somewhere in the genome, so as to make 
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the cell responsive to glucocorticoids. The use of a 
regulatable promoter could be desirable whether or not the 
gene of interest is normally transcriptionally silent. 
Other kinds of regulation can also be obtained by targeting 
5 the appropriate DNA regulatory segment to the exact 

position of interest by means of the process of the present 
invention. 

Thus, while stimulation of expression of normally 
transcriptionally silent genes is the preferred application 
10 of the present invention, in its broadest sense it is 
applicable to the modification of expression 
characteristics of any gene endogenous to the host cell 
line. 

The specific technique of homologous 
15 recombination is not, per se, a novel part of the present 
invention. Such techniques are known and those of ordinary 
skill in this art will understand that any such technique 
can be used in the present invention as long as it permits 
targeting of the DNA regulatory sequence to the desired 
20 location with respect to the gene of interest. While a 
preferred technique is disclosed, using a linearized 
construct with two homologous regions on either end of the 
sequences to be inserted, any other technique which will 
accomplish this function, as, for example, by using 
25 circular constructs, is also intended to be comprehended by 
the present invention. The critical feature of the present 
invention is the use of homologous recombination techniques 
to insert a DNA regulatory sequence which causes 

30 
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modification of expression characteristics in the cell line 
or microorganism being used, operatively linked with a gene 
in the genome of the cell line, preferably one which is 
normally transcriptionally silent, or to insert an 
5 amplifiable sequence, without a regulatory sequence, 

sufficiently near a gene in the genome of the cell line 
which already transcribes as to cause amplification of such 
gene upon amplification of the amplifiable sequence. It is 
not absolutely necessary that a selectable marker also be 
10 included. Selection can be based solely on detection of 

the gene product of interest or mRNAs in the media or cells 
following insertion of the DNA construct. Furthermore, in 
the embodiment in which a regulatory sequence is being 
inserted, amplification, while desired, is not critical for 
15 operability. The same is true for the negative selection 
gene which makes the screening process easier, but is not 
critical for the success of the invention. Thus, the basic 
embodiment requires only insertion of the DNA regulatory 
segment or the amplifiable segment in the specific position 
20 desired. However, the addition of positive and/ or negative 
selectable marker genes for use in the selection technique 
is preferred, as is the addition of an amplifiable gene in 
the embodiment in which a regulatory segment is being 
added. 

25 The term "modification of expression" as used 

throughout the present specif ication and claims, is hereby 
defined as excluding termination of expression by inserting 
by homologous recombination a mutjation, deletion, stop 
codon, or other nucleotide sequence, including an entire 

30 gene, into the gene of interest, so as to prevent the 

product of interest from being expressed. The prior art 
teaches the use of homologous recombination to insert 
specific mutations and the expression of a cell product may 
have inherently been terminated by means thereof (see, for 

35 example, Schwartzberg et al, PNAS (USA) , 87:3210-3214 
(1990)). The present invention is not intended to 
encompass such a procedure. In the present invention the 
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"m dif ication of expression" is acc mplished by means of 
inserting regulatory and/or amplification regions at a 
specific desired location by means of homologous 
recombination. The preferred modifications are those which 
5 activate and/ or enhance expression of the product of 
interest. 

Whenever the present specification uses the 
phrase that a DNA regulatory segment is "operatively linked 
with" a gene, such terminology is intended to mean that the 

10 DNA regulatory segment is so disposed with respect to the 
gene of interest that transcription of such gene is 
regulated by that DNA regulatory segment. The regulatory 
segment is preferably upstream of the gene, but may be 
downstream or within the gene, provided that it operates to 

15 regulate expression of the gene in some way. The DNA 

regulatory segment may be a promoter, terminator, operator, 
enhancer, silencer, attenuator, or the like, or any 
combination thereof. 

Whenever the terms "upstream" or "downstream" are 

20 used in the present specification and claims, this is 

intended to mean. in the 5 '-direction or the 3 '-direction, 
respectively, relative to the coding strand of the gene of 
interest. 

The foregoing description of the specific 
25 embodiments so fully reveals the general nature of the 

invention that others can readily modify and/or adapt such 
specific embodiments for various applications without 
departing from the generic concept. Any such adaptations 
and modifications are intended to be embraced within the 
30 meaning and range of equivalents of the disclosed 

embodiments. It is to be understood that the phraseology 
and terminology employed herein are for the purpose of 
description and not of limitation. 
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WHAT IS CLAIMED IS: 

1. A method of activating a normally 
transcriptionally silent gene within the genome of a cell 
line or microorganism so as to enable said cell line or 

5 microorganism to express the gene product of said gene, 
comprising inserting a ONA construct into said genome by 
homologous recombination, said DNA construct comprising a 
ONA regulatory segment capable of stimulating expression of 
said gene when operative ly linked thereto and a DNA 
10 targeting segment homologous to a region of said genome 

within or proximal to said gene, wherein said construct is 
inserted such that said regulatory segment is operatively 
linked to said gene of interest. 

2. A method in accordance with claim 1, 21, or 
15 22, wherein said DNA construct comprises two DNA targeting 

segments, each homologous to a region of said genome within 
or proximate to said gene, one of said targeting segments 
being upstream of said regulatory segment and the other of 
said targeting segments being downstream of said regulatory 
20 segment. 

3. A method in accordance with claim 1, 2, or 
21, wherein said DNA construct additionally comprises at 
least one expressible selectable marker gene disposed so as 
to be inserted with said regulatory segment. 

25 4. A method in accordance with claim 1, 2, 3, 

21, 22, or 23, wherein said DNA construct additionally 
comprises a negative selectable marker gene disposed with 
respect to said targeting segment so as not to be inserted 
when said construct is properly inserted by homologous 

30 recombination, whereby said negative selectable marker is 
not expressed in cells in which said DNA construct is 
properly inserted. 

5. A method in accordance with claim 1, 2, 3, 4, 
or 21, wherein said DNA construct additionally comprises an 

35 expressible amplifiable gene disposed so as to be inserted 
with said regulatory segment. 
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6. A method }n accordance with claim 1, 2, 3, 4, 
5, 21 , 22, or 23, wherein said cell line or microorganism 
is a eukaryotic cell line. 

7. A method in accordance with claim 6, wherein 
5 said cell line or microorganism is an animal cell line. 

8: A method in accordance with claim 6, wherein 
said cell line or microorganism is a mammalian cell line. 

9. A method in accordance with claim 6, wherein 
said cell line or microorganism is a plant cell line. 
10 10. A method in accordance with claim 3, and 

additionally for causing expression of said gene product, 
further including the steps of,, following said inserting 
step: 

selecting clones of said cell line or 
15 microorganism which express the product of said selectable 
marker gene; 

cultivating the selected clones under conditions 
sufficient to permit expression of said gene product; and 

collecting said gene product. 
20 il» A method in accordance with claim 10, 

wherein said selectable marker gene is the neomycin 
resistance gene and said selecting step comprises selecting 
those clones having neomycin resistance. 

12. A method in accordance with claim 10 or 11, 
25 wherein said DNA construct additionally comprises a 

negative selectable marker gene disposed with respect to 
said targeting segment so as not to be inserted when said 
construct is properly inserted by homologous recombination, 
whereby said negative selectable marker is not expressed in 
30 cells in which said DNA construct is properly inserted, and 
said selecting step further includes selecting those clones 
which do not express said negative selectable marker gene. 

13. A method in accordance with claim 12, 
wherein said negative selectable marker gene is the Herpes 

35 Simplex Virus thymidine kinase gene and s,aid selecting step 
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includes selecting those clones which survive exposure to a 
media that kills cells which express said gene. 

14. A genome having a DNA regulatory segment 
operatively linked with a naturally occurring gene at an 

5 insertion site characterized by a predetermined DNA 

sequence , said DNA regulatory segment not being naturally 
occurring at said location in the genome. 

15. A cell line or microorganism capable of 
expressing q gene product by a normally transcriptionally 

10 silent gene within the genome of said cell line or 

microorganism , said genome having inserted therein a DNA 
regulatory segment operatively linked with said normally 
transcriptionally silent gene , said DNA regulatory segment 
being capable of promoting the expression of a gene product 

15 by said cell line or microorganism. 

16. A cell line or microorganism in accordance 
with claim \5 or 25, wherein said DNA regulatory segment is 
one which is capable of promoting the expression of a gene 
product normally expressed by said cell line or 

20 microorganism. 

17. A cell line or microorganism in accordance 
with claim 16, wherein the inserted DNA regulatory segment 
is part of a DNA construct comprising said DNA regulatory 
segment and at least one selectable marker gene. 

25 18. A cell line or microorganism in accordance 

with claim 17, wherein said DNA construct additionally 
comprises an amplifiable gene. 

19. A method of obtaining a gene product from a 
cell line or microorganism, comprising culturing a 

30 differentiated cell line or microorganism in accordance 
with claim 15-18 or 24-26 under conditions which permit 
expression of said gene product, and collecting said gene 
product. 
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20. A DNA construct for insertion into a 
predetermined host cell line or microorganism, comprising a 
DNA regulatory segment capable of modifying the expression 
characteristics of genes in the host cell line or 

5 microorganism when operatively linked thereto and a DNA 

targeting segment homologous to a region of the genome of a 
preselected gene within the host cell line or 
microorganism . 

21. A method of modifying the expression 

10 characteristics of a gene within the genome of a cell line 
or microorganism, comprising inserting a DNA construct into 
said genome by homologous recombination, said DNA construct 
comprising a DNA regulatory segment capable of modifying 
the expression characteristics of said gene when 

15 operatively linked thereto, as compared to its existing DNA 
regulatory segment, and a DNA targeting segment homologous 
to a region of said genome within or proximal to said gene, 

wherein said construct is inserted such that said 
regulatory segment is operatively linked to said gene of 

20 interest. 

22. A method of modifying the expression 
characteristics of a gene within the genome of a cell line 
or microorganism, comprising inserting a DNA construct into 
said genome by homologous recombination, said DNA construct 

25 comprising an expressible, amplifiable gene capable of 

amplifying said gene when inserted in sufficiently close 
proximity thereto, and a DNA targeting segment homologous 
to a region of said genome within or proximal to said gene, 
wherein said construct is inserted such that said 

30 amplifiable gene is in sufficiently close proximity to said 
gene of interest to cause amplification thereof when said 
amplifiable gene is amplified. 

23. A method in accordance with claim 22, 
wherein said DNA construct additionally comprises at least 

35 one expressible selectable marker gene disposed so as to be 
inserted with said expressible, amplifiable gene. 
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24. A cell line or microorganism capable of 
enhanced expression of a gene product compared to the cell 
line or microorganism from which it is derived, said gene 
product beipg the expression product of an endogenous gene 
5 within the genome of said cell, said genome having inserted 
therein in an operative manner, at or near said endogenous 
gene, an exogenous DNA regulatory segment and/ or 
amplifiable gene capable of enhancing the expression of 
said gene product by said cell line or microorganism. 
10 25. A cell line or microorganism in accordance 

with claim 24 , wherein said exogenous DNA regulatory 
segment and/or amplifiable gene is an exogenous DNA 
regulatory segment. 

26. A cell line or microorganism in accordance 
15 with claim 24, wherein said exogenous DNA regulatory 

segment and/or amplifiable gene is an exogenous amplifiable 
gene. 

27. A DNA construct for insertion into a 
predetermined host cell line or microorganism, comprising 

20 an expressible, amplifiable gene capable of amplifying a 
gene in the host cell line or microorganism when inserted 
in sufficiently close proximity thereto, and a DNA 
targeting segment homologous to a region of the genome of a 
preselected gene within the host cell line or 

25 microorganism. 

28. A method in accordance with claims 1, 2, 3, 
4, 5, 21, 22, or 23, wherein said cell line or 
microorganism is a microorganism. 

30 



35 



WO 91/09955 



PCI7US90/07642 



1/14 

FI G . I 




SUBSTITUTE SHEET 



WO 91/09955 



PCI7US90/07642 



2/14 

FIG . 2 A 



RANDOM INTEGRATION 




RANDOM INTEGRATION 



emmm p 



D' 



C 



FIG . 2B 



HOMOLOGOUS RECOMBINATION 




HOMOLOGOUS RECOMBINATION 



W//W/////M 



TRANSCRIPTION 



SUBSTITUTE SHEET 



WO 91/09955 



PCI7US90/07642 



3 / 14 

FIG. 3 

HOMOLOGOUS RECOMBINATION CONSTRUCT FOR RAT 

: TSH BETA 



E' 



■Mff/A t' C 



FIG. 4 




B 



GENE OF INTEREST ( 



HOMOLOGOUS RECOMBINATION 



GENE OF INTEREST / 



B G' C 



B 



SUBSTITUTE SHEET 



WO 91/09955 



PCI7US90/07642 



4 / 14 

FIG. 5 



Hind 3 (5025) 




Aat2 (2567) 



ARROW INDICATES SENSE DIRECTION 



SUBSTITUTE SHEET 



PCT/US90/07642 



5/14 

F I G . 6 




* SITE NO LONGER EXISTS 
ARROW INDICATES SENSE DIRECTION 



SUBSTITUTE SHEET 



WO 91/09955 



PCT/US90/07642 



6 / 1 i 4 

FIG. 7 



Bgl 2(5229) 




Apa1 (3112) . Aat2(2561) 



ARROW INDICATES SENSE DIRECTION 



SUBSTITUTE SHEET 



WO 91/09955 



^ PCT/US90/07642 



7/14 



FIG. 8 




"(I006) 



*SITE NO LONGER EXISTS 
ARROW INDICATES SENSE DIRECTION 



SUBStrrUt&SHBET 



WO 91/09955 



PCT/US90/07642 



8/14 



FIG. 9 



Act 1 1(4262) 




Hind II I (400) 

Sphl(406) 

Pstl(412) 
Soll(418) 

NdeT/Xba 1(424) 



Hind37Xba1(1002) 
Bgl2VBamHT(1006) 



SmalVXho 1(2080) 
Kpnl(2084) 
Sacl ( 2090) 

EcoRI(2096) 



* SITE NO LONGER EXISTS 
ARROW INDICATES SENSE DIRECTION 



SUBSTITUTE SHEET 



WO 91/09955 



PCT/US90/07642 



9/14 



polyA(5141) 
BomH1(5028) 

Apa 1(47^3) 



FIG. 10 



Hind 3 (6660) 




Sma1 /*Xho1*/Sal1"(578) 



Bgl2*/BdmH1 
"(1640) 



Sail (2213) 



Aat2(4202) 



* SITE NO LONGER EXISTS 
ARROW INDICATES SENSE DIRECTION 



SUBSTITUTE SHEET 



PCI7US90/07642 



10 / 14 



FIG. II 



RAT TSH BETA 



Xbal Xbal Hind3 Xbal 



23 



2255 bp 

H. 



10600 bp 

' 4700 bp 

I5300bp 



SUBSTtTiJtBrSHEET 



WO 91/09955 



PCI7US90/07642 



11 / 14 



FIG. 12 



Xb'plV.Hind3(8916) 

Smar/XholV Salf(578) 

p\ Bgl27BamHT(1640) 



Apa1(6717L_ 



Hind 3(6662) w 




Sal 1(2213) 



poly A (5141 ) 
BamH1 (5028) 



Apa 1(4753) Aat2(4202) 



• SITE NO LONGER EXISTS 
ARROW INDICATES SENSE DIRECTION 



SUBSTITUTE SHEET 



WO 91/09955 



PCT/US90/07642 



12/14 



F I G . 13 



Apol (17331) 
Hind 3(17281) 



pol y A ( 15760) 
BamH1(15647) 

ApaK 15372) 
Aat2(14821) 



Sal 1(12832) 



Xba1VHind3(19535) 

Srna1*/Xho1VSair(5 78) 
Bgl2VBamH1»(1640) 

Sail (2215) 
BamH1(2220) 

Hind 3 (2681) 




Hind 3(7093) 



Hind3( 11253) 



Apa1 (8943) 



* SITE NO LONGER EXISTS 
ARROW INDICATES SENSE DIRECTION 



SUBSTITUTE SHEET 



WO 91/09955 



PCI7US90/07642 



FIG. 14 13/14 

LOCATION OF PRIMERS FOR PCR AMPLIFICATION OF TSH BETA 

5' 

g gc.a cgcctct go a tg t gg aa a g gacactt at gogctctgtggtctttccctctgattt 
a g CATGAATGCTGTCGTTCTCTTTTCCGTGCTTTTCGCT^ 

5' TSHB5 3" 

A G T ATATGATGTACGTGGACAGG 
C AT C GTTTTGTAT TCCCACTGAGT ATATGV\TGT ACGTGGACAGGAGAGAGTGTGCCTAC 

** ********** ************************* 

TGCCTG ACCATCAACACCACC ATCT GCGCTGGGTATTGT ATGACACGG gtatgttggt 

* ***** ************ * ** *************************** 

cact gcgtttct 1 1 tagct gt o go t tgfocoggf ctoaogtt gtctgttaatattttag 
a a aggaagtggga t aaa t c at a g t c t c ctctt tgggaagccaagcacactgctttcaga 

a 1 1 a tgattatg t cat tc t a cac ag aaaaagta cagatacat t gtaacagtttacccta 
aag t gt t tgt t ct gctcaa t gg t ag a tgagaagaaagtgtccttttttgtctctgaggg 

g t t aog tgtagat g tgtggg to a c a gagctcoggagtcctttaagatcatcaggaaaca 
a a gg go tot.tqgtc a ttc t a 1 1 a c ac taagttg catgcagtttatcatgttaagatctc 

t 1 1 1 c t tccacag GATATCAATG GCAAACTGTTTCTTCCCAAGTACGCACTCTUTCAG 
* ** ****** * ** ******** ************************* 

G ATG TCTGT ACATA C AG AG A C T TC ACCTACASAACGGTCGAMTACCGGGATGOXACA 

* *X***X*****X*X********* *** ******************************* 

C C ATGTTGCTCCTT ATTTC TC CT A C CCCGTTGCCCTGAGCTG(^GTGTGGCAAGTTj1A 

»*)HH()HS)«i(**if*»»XX XX ***** ********************************* 

G GACTCGACGTTCACACCGTTCAC 
3' TSHB3 5' 

A C ACTG ACTACAGCGACTG TA C A C A CG AGGCTTGTCAAMCCAACTACTGCACCAAGCCA 

C AG AC ATTCTATCTGGGGG GATTTTCTGGTTAACTGTAATGGCAATG C AATCTG GTTAA 
A T G TG TTTACCTG G A ATAG A ACTA ATA AAATATC ATTG AT atg tct tgcctgc cattt 
a a t c ca taggcaca t ccacaaggcat t ago gage t tacacaact t tagaagcagaggcg 

3' 

EXONS 2 AND 3 ARE IN CAPITAL LETTERS 
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